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ABSTRACT 

The  jackknife  estimator  of  the  asymptotic  variance  of  a  two  sample  linear  rank  statistic  is 
shown  to  be  strongly  consistent.  Statistical  applications  of  the  result  are  discussed.  The  tech¬ 
nique  used  in  proving  the  consistency  of  the  jackknife  variance  estimator  can  be  applied  to 


general  situations. 
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1.  Introduction  and  the  main  result 

Consider  the  following  test  problem  concerning  two  (not  necessarily  continuous)  popula¬ 
tion  distributions  F  and  G : 

HqiF^G  vs.  Hi.F^G.  (1.1) 

Let  {  }  and  {  }  be  independent  samples  from  F  and  G,  respectively.  For 

simplicity,  we  assume  that  m=n.  The  results  obtained  in  the  following  can  be  extended  to  the 
case  nlrn-iX,  0<X<1,  with  some  modifications.  The  statistic  for  the  test  problem  (1.1)  is  the 
following  two-sample  simple  linear  rank  statistic  (sec,  e.g.,  Hdjek  and  Sid^k,  1967;  Huber, 
1981): 

S(F„,G„)  =  ij[y2F„(x)+yiG„{x)]dF„{x),  (1.2) 

where  and  G^  are  empirical  distribution  functions  corresponding  to  the  samples 
{ }  and  {  Y^,...,Yf^  },  respectively,  and  /  is  a  score  function  satisfying 
y(l-r)  =  -J (r),  re [0,1J.  Let  H  =  ^liF+^liG  and  //„  =  V2F„-(-V2G„.  S (Fn,G„)  can  be  used  as 
a  point  estimator  of  the  quantity 

SiF,G)  =  jjlHOc)]dF{x). 

We  assume  that  S(F,G)  =  0  under  the  null  hypothesis  Hq  (which  is  satisfied  if  F  is  symmetric 
or  F  is  continuous).  Thus,  we  reject  Hq  if  l5(f’„,G„)l  is  large. 

An  asymptotic  analysis  of  the  sampling  distribution  of  S  (F„  ,G„ )  is  needed  for  obtaining 
the  critical  value  for  the  test  problem  (1.1)  and  for  calculating  the  power  of  the  test  procedure. 
Chemoff  and  Savage  (1958)  showed  that  under  certain  conditions  (see  also  Hajek  and  Sid^, 
1967,  pp.233-237),  (2n)'^’[5(F„,G„)-5(F,G)]  converges  in  distribution  xo  N(0,  a^)  with 

Vflr^v(ri),  (1.3) 

where 

<l>(x)  =  V2jy'[//(y)](/(,^,)-F(y)]dF(y)  +  /[//(x)]  -  J/[//(y  lldFfv) 

V(x)  =  V2|7'[//(y)][/(,<^)-G(y)ldF(y), 

the  indicator  function  of  the  set  A  and  J'  is  the  derivative  of  J .  Note  that  <J)(x)  and  \|/(x) 
in  (1.4)  are  influence  functions  of  5(F,G)  by  using  a  statistical  functional  approach  (see 
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Hampel,  1974;  Huber,  1981). 

Suppose  that  we  have  a  consistent  estimator  of  given  in  (1.3),  i.e.,  a.s. 

Then 

(2n)'^’[5(F„,G„)-S(F,G)]/j  ^A^(0,1)  in  distribution. 

Hence  a  test  procedure  with  approximate  level  a  (0<a<V2)  concludes  Hj  if 

(2n)‘/MS(F„,GJI/5  >  <I>-i(l-a/2),  (1.5) 

where  O  is  the  distribution  function  of  N(0,l).  (1.5)  gives  the  critical  region  of  the  test  for 

(1.1). 

In  this  note  we  prove  that  an  estimator  of  obtained  by  using  the  jackknife  method 
(Tukey,  1958)  is  strongly  consistent  and  therefore  can  be  used  in  the  above  test  procedure. 

For  i=l,...,n,  let  and  be  the  empirical  distribution  functions  corresponding  to  the 
samples  f  j  and  /"  Ti,...,F,_i,F;+i,...,y„  respectively,  and 

Hni  =  +  VaG^;.  Let  5(F^-,G„,  )  be  defined  as  in  (1.2)  with  and  G„  replaced  by 

and  G„i .  The  jackknife  estimator  of  is  defined  to  be 

s}  =  («-l)Zr=i[5(F^-,G„,)  -  S(F„,GJ]2 
We  shall  assume  the  following  condition. 

Condition  A.  J'  is  continuous  on  [0,1]  and  117' II y  is  finite,  where  II  11  y  is  the  total  variation 
norm  (see  Natanson,  1961). 

Note  that  J'  satisfies  J'G-t)  =  J'{t)  for  /■e[0,l].  Hence  the  condition  117'lly  <  <»  is 
satisfied  if  7'  is  monotone  on  [O.Va].  If  7"  exists,  then  Il7'lly  =  I7"(r)ldt  and  therefore 
117' II y  <  oo  if  7"  is  integrable.  An  example  of  7  satisfying  condition  A  is  7(r)  =  t-V2 
(corresponding  to  Wilcoxon  statistic). 

The  following  is  our  main  result. 

Theorem.  Assume  condition  A.  Then  the  jackknife  estimator  is  strongly  consistent,  i.e., 

Sj  0^  a.s. 
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2.  Proof  of  the  theorem 


Let  <J»„(a:)  and  be  defined  as  in  (1.4)  with  F,  G  and  H  replaced  by  G„  and  //„, 
respectively.  We  prove  the  following  result  first. 


Lemma.  Assume  condition  A.  Then 

II  “  <t>  II  oo  0  a.s.  and  llv„— vll»— >0  a.s.,  (3.1) 

where  II  II «  is  the  sup  norm. 

Proof.  Under  condition  A,  ll/'ll>,  <  «».  From  IIF„-F  ll,,  0  and  IIG„-G  11^  —>  0  a.s. , 
IJ[N„(x)]-J[H(x)]l  <  II7'IL  ll//„  -//IL-^0  a.s. 

and 

Jl7[//„(x)]-/[//(;c)]ldF„(x)S  IIJ'IL  »//„-// IL 0  a.s. 

From  the  strong  law  of  large  numbers  (SLLN), 

Jy[//(A:)]d[F„(x>-F(x)]  0  a.s. 


Hence 

li/[HJ-//[//„(x)]dF„(x)  -/[//]-Jy(//(x)]dF(x)ll„  0  a.s. 


For  the  first  assertion  in  (3.1),  it  remains  to  show  that 

supjJ/'[H„(y)][/(,<^)-F„(y)]dF„(y)-J/'[//(y)][/(^<y)-F(y)]dF(>’)l  ^0  a.s.  (3.2) 
The  quantity  in  (3.2)  is  bounded  by 

l^'[//„(y)][F(y)-F„(y)]dF„(y)l  +  sup  jJy'[//(}’)][/(;c<yr^()')]d[/='.(>’)-^(>’)]  I 

(3  3) 

+  sup,  lJ{y'[//„(y)]-7'[//(y)]}[/(,<^)-F(y)]dF„(y)l. 

The  first  term  in  (3.3)  is  bounded  by  ll/'ll„IIF„-F  ll„„  — >  0  a.s.  The  third  term  in  (3.3)  is 
bounded  by  !!/'[//„]  -/'[//]  11^,  which  — >  0  a.s.  since  J'  is  continuous  on  [0,1].  From  the 
SLLN,  J/'[//(y)]F(y)d[F„(y)-F(y)]  0  a.s.  Hence  (3.2)  follows  from 

sup,  lJ/'[//(y)]/(,<^)d[F„(y)-F(y)]l  -4  0  a.s.  (3.4) 


Let  /,(y)  =  ■f(x<y)  g,(y)  =  /'[//(y)]/,(y).  From  Natanson  (1961,  p.232), 

lj7'[//(y)]/(,<^)d[F„(y)-F(y)]l  <  llg,  II v  IIF„-F  IL. 
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Note  that  llg^^llv  ^  ll/'lly  II4  +  II/'II„II4  llv  ^  H/'Hv  +  Hence  (3.4)  holds  and 

the  first  assertion  follows.  The  proof  of  the  second  assertion  is  similar, 


Proof  of  Theorem.  Let 

=fmd[F^(x)-F„(x)]+Ux)d[G^(x)-G„(x)]. 

Ura  =J[<l>«(^)-^(^)Jd[F„(x)-F„(jc)]  +  J[v„(x)-v(Jc)]d[G„(x)-G^(x)]. 

=  IUlHru(x)hJlMn(x)]]dF^(x)  -  |/'[//„(jc)][//^(;r)-//„(;c)]dF„(;c) 

and  Rni  =  U„i  +  .  Then 

5 S(F„,GJ  =  V'„, ■+/?„• 

and  therefore 

s}  =  (n-l)X"  ,(Vi  +K^+  2V„R„,). 

Ut  =  (KX,),  C,  =  I  =  an<l  C  =  ""'Sf.iC  -  Then 

(n-DX"  =  ('■-i)iX,".,Kn-ir'Xj.i5>  -  If  +  xr.,f("-ir'rj.,  ^j  - 
+ 2xr.iK'‘-i)'‘Xy„-^/  -  -  ai 

=  (n-ir'Xi, «,-?)"  +  <«-ir‘x,".,(;i-o"  +  2(-'-ir'x” 

which  converges  a.5.  to  according  to  the  SLLN.  From  Cauchy-Schwarz  inequality,  it 
remains  to  show  that 

0  G.J., 

which  is  implied  by 

maxi<„  IG„,- 1  =oin~^)  a.s.  (3.5) 

and 

maxi<„  \  W^  I  =  o(n~^)  a.s.  (3.6) 


Since 

I  ix)-^{x)]d[F^ixyF^  ix)]  I  =  («-l)-M())(X,)  -  ())„  (X,)  -  n-iX"^j(|)(X;)  1 
<  («-l)-‘  ||(|)„  -  ({>IL  +  [«(«-!)]-'  •  I, 

lJ[<J)„(x)-<J)(x)]d[F„j(x)-F„(x)]  I  =  o(n"’)  a.s.  follows  from  II <[)„  -  <j) II „ 0  a.^. 
(Lemma)  and  ^  (SLLN).  Similarly,  we  can  prove  that 
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*nax,-^  I /[v„ (x )-\|/(jc )]d [G^ (x )-G„ ix)]  \  =o{n  a.s. 

Hence  (3.5)  holds.  From  the  continuity  of  J'  and  n“\ 

miXi^n\hj[H^ix)]~J[HM]-nHr,(xmni(xyH„(x)]]dF„(x)\=o(n-^)  a.s. 

Then  (3.6)  follows  from 

max.<„  )]-/[//„ (x)l)d[F^(jc)-F„(x)]  I  =o(n-^)  a.s.  (3.7) 

Again  from  the  continuity  of  J',  (3.7)  follows  from 

max,<„  \!r[H„(x)][H„i(x)~H„(x)]d[F„iix)-FM] '  =  o(«-')  a.J.  (3.8) 

Note  that 

lj7'[//„(jc)][Wm(x)-//„(x)]d[F„,(x)-F„(x)]l  <  IIF„,-  -F„ILII7'[//„][//„, --//„]  11^ 

<  <  n"kll7'llv  IL+  II v). 

Since  F^  (y  )-F„  (y )  =  (n -1)"^[F„  (y  )-/x..(y  )1.  where  Ix.(y)  =  / (x,.<y ). 

-Fn  llv  =  («-ir^ll^„  ^  (n-l)~kllF„  11k  +  II  v)  =  2(n-l)-^ 

Similarly,  IIG^  -  G„  lly  ^  2(m-1)“*  and  therefore  li //„,  -//„  II  v  <  2(n-l)"^  Hence  (3.8) 
holds.  This  proves  the  theorem.  ° 


3.  Comments 

In  some  situations  (e.g.,  F  is  continuous),  the  variance  of  S{F„,G„)  under  the  null 
hypothesis  Hq  can  be  calculated  using  theory  of  rank  statistics  (see  Hajek  and  Sidak,  1967). 
Hence  the  critical  region  of  the  test  procedure  for  (1.1)  can  be  constructed  by  using 
s^  =  (2n)Var[S iF^,G„)\HQ],  provided  (2/z)Vflr[S(F„,G„) IHq]  — >  o^.  In  these  situations,  the 
jackknife  just  gives  an  alternative  method.  Computing  the  jackknife  estimator  is  routine  and 
simple  and  does  not  require  a  theoretical  derivation  of  Var  [5(F„,G„)  IHq].  Furthermore,  the 
consistency  of  the  jackknife  estimator  holds  under  both  null  and  alternative  hypotheses  and 
therefore  the  jackknife  may  provide  other  statistical  analysis  procedures  in  some  situations. 
For  example,  suppose  that  under  the  alternative  Hj,  G{x)-qF  +pF  (see  Serfiing,  1980, 
p.293),  where  p  may  or  may  not  be  known,  0<p^l  and  q=\-p .  Suppose  also  that  F  is  con¬ 
tinuous.  Then 
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S(F.G)  =  JV[f-p(f-r2)]dr. 

Denote  this  quantity  by  g(p).  Then  the  power  of  the  test  at  p  =  p  i  is  approximately 
1  -  <D[0~'(l-<x/2)-n'*5(Pi)/^y]  +  O[<^~K0L/2)-n''^g(px)fsj]. 

Assume  that  J  is  strictly  increasing.  Then  g(p)  is  strictly  decreasing  in  p .  If  p  is  unknown, 
an  approximate  l(X)(l-a)%  confidence  interval  for  p  has  limits 

g-\S(F,,GJ  ±  ^-Hi-a/2)n-'f^sjl 

Finally,  the  technique  used  in  the  proof  of  the  consistency  of  jackknife  estimator  can  be 
applied  to  general  situations  where  S(F,G)  is  a  functional  with  inference  functions  satisfying 
(3.1). 
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